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INTRODUCTION 


Plants  have  been  tested  for  use  as  sensors  for  environmental  pollution 
because  they  are  inexpensive  to  produce  and  maintain  and  are  in  direct  contact 
with  the  environment.  Most  bioassays  measure  only  the  harmful  effects  of 
pollutants  on  plants  by  scoring  survival  rates  or  visible  indicators  such  as  leaf 
damage  and  growth  inhibition.  However,  an  emerging  approach  to 
environmental  bioassays  is  to  detect  changes  in  gene  expression  as  a  result  of 
pollution.  Assays  based  on  such  genetic  responses  have  been  largely  limited  to 
detection  of  bacterial  genes  encoding  pollutant  degradation,  or  uncharacterized 
stress-responsive  genes  (Blom,  et  al.,  1993;  Erb  and  Wagner-Dobler,  1993;  Van 
Dyk,  et  al.,  1994).  Plants  also  modify  their  gene  expression  in  response  to 
environmental  changes,  activating  different  metabolic  pathways  to  protect  cells 
and  tissues  from  damage  while  repressing  others.  By  monitoring  expression  of 
genes  that  respond  to  specific  pollutants,  it  may  be  possible  to  detect  these 
environmental  contaminants  indirectly,  using  plants  as  environmental 
monitors. 

Such  a  molecular  bioassay  depends  on  the  discovery  of  specific  probes  for 
genes  that  respond  to  particular  environmental  factors.  Datura  innoxia  and 
certain  other  plant  species  tolerate  moderate  amounts  of  2,4,6-trinitrotoluene 
(TNT)  in  soil  and  cell  culture  media.  These  plants  accumulate  and  metabolize 
TNT.  The  basis  of  tolerance  may  include  an  inducible  change  in  the  pattern  of 
gene  expression  produced  by  the  plant.  By  monitoring  these  changes,  one  may 
discern  a  specific  "mRNA  or  protein  signature"  that  is  unique  to  TNT  exposure 
Identification  of  such  a  pattern  (and  associated  molecular  probes  to  detect  this) 
should  allow  development  of  strategies  that  use  plants  as  "environmental 
sentries"  to  monitor  TNT  soil  contamination  and  migration. 

A  molecular  bioassay  depends  on  the  discovery  of  specific  probes  for 
response  to  particular  environmental  factors.  Arbitrarily  primed  PCR  RNA 
fingerprinting  (RAP),  as  described  by  Welsh,  et  al.  (1992),  was  used  to  identify  and 
clone  cDNA  fragments  encoding  messenger  RNA’s  whose  concentrations 
increase  or  decrease  with  TNT  exposure.  Amplifying  different  arbitrary  subsets 
of  these  mRNA’s  displayed  these  profiles  in  an  easily  interpreted  gel 
electrophoresis  pattern.  While  the  majority  of  cDNA  products  were  unchanged  by 
changing  TNT  concentrations  and  exposure  times,  a  small  subset  showed 
changes  that  were  TNT  dose  or  time  dependent.  DNA  fragments  specifically 
produced  in  reactions  containing  mRNA  template  from  TNT-treated  cells  were 
excised,  and  the  cDNA  cloned  and  sequenced.  Searches  for  homologous 
sequences  in  nucleic  acid  and  protein  sequence  databases  demonstrated  that 
three  cDNA  isolates  have  significant  homology  to  previously  described  protein  or 
cDNA  sequences.  The  remaining  clones  appear  to  represent  previously  unknown 
genes  whose  expression  may  respond  specifically  to  TNT  exposure.  Molecular 
probes  for  these  genes  have  been  developed  to  further  study  kinetics  of  TNT- 
induced  gene  expression.  Probes  developed  to  detect  changes  in  gene  expression 
in  plant  cell  cultures  exposed  to  TNT  are  being  tested  to  determine  whether  the 
same  genes  are  similarly  regulated  in  intact  plants.  A  collaboration  with  an 
independent  private  company  is  being  developed  to  determine  whether  this 
approach  can  be  used  to  monitor  changing  TNT  concentrations  in  a  constructed 
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ecosystem  that  uses  different  plants  to  remove  and  metabolize  TNT  from 
contaminated  soils  and  effluents. 

EXPERIMENTAL  METHODS 

Cell  culture  growth.  Suspension  cultures  of  Datura  innoxia  cells  were  grown  as 
previously  described  (Jackson,  et  al.,  1984).  Cultures  in  250  mL  baffled  flasks 
were  diluted  4-fold  every  2  days  by  adding  12.5  mL  of  suspension  to  37.5  mL  of 
fresh  medium.  Suspensions  were  aerated  by  shaking  (120  rpm)  at  30°C  in  the 
dark.  Stock  cultures  were  expanded  to  provide  cells  for  experiments  and  were 
allowed  to  grow  for  24  hours  after  passage  before  exposure  to  the  inducing  agent. 
TNT  Exposure.  A  110  mM  stock  solution  of  TNT  in  ethyl  alcohol  was  prepared 
and  stored  in  the  dark  at  4°C.  This  was  introduced  to  final  concentrations  of  44, 
110,  220  and  440  pM  TNT  into  cultures  diluted  in  fresh  growth  medium  24  hours 
previously.  Previous  experiments  showed  that  44  pM  TNT  had  no  effect  on 
long-term  cell  growth  rates,  while  110  and  220  pM  TNT  significantly  inhibited 
growth  and  440  pM  TNT  caused  death  within  48  hours.  Control  cultures 
containing  no  additives  or  the  same  ethyl  alcohol  concentration  as  cultures 
containing  440  pM  TNT  were  also  grown  and  harvested.  Cultures  were  incubated 
and  shaken  for  2,  8  and  13  hours  following  TNT  addition  before  portions  of  the  cell 
suspensions  were  harvested  for  RNA  isolation. 

RNA  Isolation.  Cell  suspensions  (5  mL)  were  removed  from  medium  by  gravity 
filtration  and  frozen  in  liquid  nitrogen.  Frozen  cells  were  ground  to  a  fine  powder 
with  a  sterile  liquid  nitrogen-chilled  mortar  and  pestle,  and  the  RNA  was  isolated 
using  the  procedure  described  by  Chomczynski  and  Sacchi  (1987).  Reprecipitated 
RNA  was  dissolved  in  sterile  water  and  used  immediately  for  first  strand 
synthesis  or  stored  at  -70°C. 

Polv(A)+  RNA  Selection.  For  some  samples,  the  poly(A)+  RNA  fraction  was 
isolated  from  cleared  lysates  without  prior  RNA  purification.  Messenger  RNA 
was  isolated  using  a  commercially  available  kit  (PolyAttract,  Promega 
Corporation,  Madison,  WI)  or  oligo(dT)  cellulose.  The  latter  method  was  based  on 
the  Quick  Prep  kit  (Pharmacia  Biotech,  Inc.,  Alameda,  CA).  Purified  poly(A)+ 
RNA  was  used  in  first  strand  synthesis  reactions  immediately  after 
spectrophotometric  quantification. 

First  Strand  Synthesis  with  Decamer  Primers.  First  strand  cDNA  syntheses 
were  performed  in  sterile  "V"-bottom  96-well  microtiter  plates  (Nunc,  Inc.).  For 
each  25  pi  reaction,  200  ng  of  total  RNA,  or  50  ng  of  poly(A)+  RNA  were  heated  to 
65°C  for  5  minutes,  then  chilled  on  ice.  Reaction  mixes  contained  RNA,  1  pM 
10-nt  arbitrary  primer  (Table  1),  100  pM  each  dATP,  dTTP,  dCTP  and  dGTP,  50 
mM  Tris-HCl  pH  8.3,  75  mM  KC1,  3  mM  MgCl2  and  200  U  M-MLV  Reverse 
Transcriptase  (BRL-Life  Technologies,  Inc.).  Reaction  wells  were  sealed  with 
vinyl  tape  to  reduce  evaporative  loss  and  the  plate  was  incubated  for  1  hour  at 
37°C.  Reactions  were  further  incubated  at  65°C  for  15  minutes  to  inactivate  the 
enzyme.  The  resultant  first  strand  cDNA  was  stored  at  4°C  until  needed. 
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Table  1.  Oligonucleotide  Primer  Sequences 


Primer 

Sequence 

Primer 

Sequence 

AP-21 

CGA  GTA  TGA  G 

AP-26 

GAA  GCT  GAG  G 

AP-22 

TTG  ATA  CGA  C 

AP-27 

TCC  ATA  TCG  A 

AP-23 

GCT  CAT  CAG  G 

AP-28 

CGG  AGT  CTG  T 

AP-24 

CTC  CTG  TAG  A 

AP-29 

GGA  ATG  CCA  T 

AP-25 

GAT  TAT  CGC  C 

AP-30 

GAG  CAC  GAC  T 

Second  Strand  Synthesis  and  PCR  Amplification.  Five  mL  of  each  first  strand 
reaction  mix  (equivalent  to  40  ng  of  beginning  total  RNA)  was  added  to  a  25  mL 
PCR  reaction  mix  (10  mM  Tris-HCl  pH  8.3,  50  mM  KC1,  1.5  mM  MgCl2) 
containing  200  |iM  of  each  dATP,  dTTP,  dCTP  and  dGTP  and  1  pM  additional 
DNA  oligomer.  The  second  strand  mixes  were  capped  with  25  mL  melted 
paraffin  wax  and  2  units  of  Taq  polymerase  (AmpliTaq,  Perkin-Elmer  Corp.)  were 
placed  on  top  of  the  wax  in  5  pi  of  1  x  PCR  buffer.  Second  strand  reactions  were 
incubated  using  a  "hot  start"  protocol  (Bassam  and  Caetano-Annoles,  1993)  in  a 
Perkin-Elmer  9600  thermocycler  by  heating  at  94°C  for  5  minutes,  40°C  for  5 
minutes,  then  72°C  for  5  minutes.  This  was  followed  by  40  amplification  cycles  of 
94°C  for  30  seconds,  40°C  for  2  minutes,  then  72°C  for  30  seconds  (Bauer,  et  al., 
1993).  After  a  final  extension  at  72°C  for  5  minutes,  the  samples  were  stored  at 
4°C  until  analyzed  by  gel  electrophoresis. 

Polyacrylamide  Gel  Electrophoresis  and  Silver  Staining.  Amplified  DNA  was 
separated  on  a  nondenaturing  TBE-6%  polyacrylamide  gel  (Protean  II,  Bio-Rad 
Laboratories,  Cambridge,  MA)  to  resolve  fragments  between  0.2  and  1.0  kb. 
Fragments  were  detected  by  silver  staining  (Bassam,  et  al.,  1991).  Stained  gels 
were  preserved  by  drying  on  a  Bio-Rad  Model  583  gel  dryer  at  80°C  for  2  hours 
under  vacuum. 

DNA  Recovery  and  Re-Amplification.  DNA  fragments  that  showed  increased  or 
decreased  intensity  following  TNT  exposure  were  excised  from  dried,  silver 
stained  gels  and  placed  in  100  mL  of  PCR  reaction  mix  (10  mM  Tris-HCl  pH  8.3,  10 
mM  KC1,  4  mM  MgCl2,  200  pM  each  dATP,  dTTP,  dCTP  and  dGTP,  1  pM  of  the 
same  arbitrary  primer  used  to  generate  the  original  fragment,  and  10  U  of  Taq 
polymerase  Stoffel  fragment  (Perkin-Elmer  Corp.).  DNA  was  re-amplified  from 
the  dried  gel,  with  no  further  purification  (Weaver,  et  al.,  1994)  by  40  cycles  of  94°C 
for  30  seconds,  40°C  for  2  minutes,  then  72°C  for  30  seconds.  After  a  final 
extension  at  72°C  for  5  minutes,  the  reactions  were  stored  at  4°C.  Polyacrylamide 
gel  electrophoresis  of  the  reaction  products  showed  DNA  migrating  at  the  same 
rate  as  the  original  fragment,  although  the  re-amplified  fragments  were 
sometimes  accompanied  by  as  many  as  two  additional  fragments  with  slightly 
different  molecular  weights. 

Cloning  and  Sequencing.  PCR-amplified  DNA  was  ligated  to  pGEM-T  vector 
(Promega  Corp.)  without  prior  purification  from  the  PCR  reaction  mix.  PCR 
reaction  mix  (10  pi)  was  ligated  in  a  15  pi  ligation  reaction  mix  (30  mM  Tris-HCl 
pH  7.6,  10  mM  MgCl2,  10  mM  DTT,  0.5  mM  ATP)  to  5.3  ng  of  pGEM-T  by  addition 
of  1  Weiss  unit  of  T4  DNA  Ligase  (Promega  Corp.).  This  was  incubated  at  15°C 
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overnight.  The  entire  15  pi  ligation  reaction  was  used  to  transform  competent 
DH5a  cells,  using  a  5  minute  37°C  heat  shock  and  1.5  hour  pre-incubation  in  LB 
broth  before  plating  the  entire  reaction  onto  LB-agar  plates  containing  50  jig/m L 
ampicillin,  40  (ig/mL  5-bromo-4-chloro-3-indolyl-(3-D-galactoside  (X-gal)  and  200 

jxg/mL  isopropylthio-(3-D-galactoside  (IPTG). 

Overnight  cultures  were  diluted  1:200  into  10  mL  of  LB  broth  containing  50 
pg/mL  ampicillin,  then  grown  to  an  OD600  of  1.5  -  1.8.  Glycerol  (150  mL)  was 
added  to  1  mL  of  the  liquid  culture  and  the  stock  was  frozen  and  stored  at  -20°C. 
Plasmid  DNA  was  isolated  from  the  remainder  using  the  Qiagen  Tip-20  alkaline 
lysis  protocol  provided  by  the  company  (Qiagen,  Inc.,  Chatsworth,  CA).  Purified 
DNA  was  sequenced  by  the  Sanger  dideoxy  method  using  dye-terminators 
(Dye-Deoxy,  Applied  Biosystems,  Inc.,  Foster  City,  CA)  in  an  Applied  Biosystems 
model  373A  automated  sequencer.  The  M13  forward  universal  primer 
(5'-GTAAAACGACGGCCAGT-3')  and  reverse  primer 

(5'-AACAGCTATGACCATG-3')  were  used  to  sequence  both  strands. 

Northern  Blotting  or  RNAse  Protection  Assays.  DNA  sequences  were  used  to 
design  DNA  oligomers  used  as  PCR  primers  to  amplify  genes  encoding  the 
expressed  mRNA  sequences.  The  resulting  amplicons  were  ligated  directly  to 
pGEM-T  DNA  (Promega,  Inc.)  as  previously  described.  The  ligation  results  were 
used  to  transform  DH5a  cells  and  white  colonies  picked  following  ampicillin, 
IPTG,  X-gal  selection.  Cloned  inserts  were  sequenced  and  the  sequences 
compared  to  previous  cDNA  sequences.  Labeled  DNA,  used  to  probe  Northern 
blots,  was  generated  by  PCR  using  primers  specific  to  the  SP6  and  T7  promoter 
sequences  present  in  the  vector  DNA.  Hybridization  was  conducted  using 
standard  methods. 

Sequence  Database  Searches.  DNA  sequences,  minus  any  vector  and  PCR  primer 
sequences,  were  used  to  search  for  sequences  of  significant  homology  in  the 
GenBank  database  using  the  GCG  Wordsearch  module  (Devereux,  et  al.,  1984). 
The  basic  local  alignment  search  tool  (BLAST)  was  also  used  to  search  sequence 
databases  (Altschul,  et  al.  1990).  The  National  Library  of  Medicine  electronic  mail 
server  (blast@ncbi.nlm.nih.gov)  was  used  for  these  searches,  in  which  cDNA 
sequences  were  sent  via  E-mail  for  nucleotide  sequence  comparisons  (BLASTN 
program)  with  the  non-redundant  sequence  database  compiled  from  GenBank, 
GenBank  updates,  EMBL,  and  EMBL  updates.  The  cDNA  sequences  were  also 
compared  with  protein  sequence  databases  by  using  the  BLASTX  program.  Only 
default  parameters  were  used,  making  the  searches  relatively  non-selective,  but 
only  statistically  significant  matches  (P  =  1  x  10’4)  were  investigated. 

Plant  Growth.  Plant  tissue  samples  will  also  be  monitored  for  content  of  TNT  and 
its  metabolic  derivatives  and  for  TNT  induced  genes  using  the  probes  described 
above.  This  information  will  be  useful  to  determine  what  portion  of  TNT  removal 
and  biodegradation  is  accomplished  by  the  vegetative  portion  of  the  ecosystem  and 
what  portion  is  accomplished  by  microbial  activity.  Each  system  ecology  will  be 
monitored  for  TNT  depletion,  with  the  expectation  that  TNT  concentrations  will 
drop  as  liquid  moves  through  the  system.  We  expect  that  TNT  concentrations  will 
drop  at  the  same  rate  that  other  parameters  such  as  total  nitrogen  or  chemical 
oxygen  demand  declined  in  previous  pilot  studies.  Basic  operational  parameters 
such  as  water  temperature,  pH,  ORP,  flow,  greenhouse  temperature,  ambient  air 
temperature,  pump  cycling  and  aeration  cycling  will  be  continuously  monitored 
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2  hours  8  hours  13  hours 

Figure  1.  An  Example  of  Time  and  TNT  Dose-Responsive  cDNA. 

Differential  display  PCR  reactions  were  performed  using  RNA  from  cells 
exposed  to  TNT  for  2,  8,  and  13  hours.  Primer  S'-CTCCTGTAGA-S'  was  used 
in  first  and  second  strand  reactions.  Within  each  time  point  cultures  were 
treated  with  (1)  No  change,  (2)  1.6  ml/ml  ethyl  alcohol,  (3)  44  pM  TNT,  (4) 
110  pM  TNT,  (5)  220  pM  TNT  and  (6)  440  pM  TNT.  Electrophoresis  was  at 
200  V  for  14  hours  through  a  6%  TBE-polyacrylamide  gel.  Arrow  shows  a 
290  bp  amplified  cDNA  fragment  (AP24.1)  that  demonstrates  both  dose  and 
time  dependent  expression. 
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using  a  computer  for  data  collection.  All  other  samples  will  be  collected  on  a 
weekly  basis  as  grab  samples  and  analyzed  for  remaining  parameters  of  interest. 
If  indications  of  stress  in  different  portions  of  the  system  become  apparent, 
monitoring  frequency  will  be  increased.  This  work  is  in  progress.  Each  of  six 
different  treatment  plant  ecologies  is  being  established.  These  will  be  monitored 
for  bacterial  activity  using  ORP  probes,  by  selecting  samples  of  the  liquid  to  be 
analyzed  for  bacterial  content  (FAME  analysis),  by  performing  PCR  analysis  to 
study  the  distribution  of  non-cultural  microbes  and  fungi  present,  and  by 
performing  plate  counts  on  selected  media.  This  will  provide  immediate 
information  about  changes  in  microbial  content  upon  introduction  of  the  septage 
carrier  and  munitions-containing  waste  stream.  Microbial  sub-populations  are 
expected  to  change  relative  to  one  another.  However  absolute  quantities  of 
microbes  are  expected  to  remain  approximately  the  same. 


RESULTS 

Classes  of  Differential  Display  Patterns.  A  representative  example  of  TNT- 
induced  gene  expression  identified  using  the  methods  outlined  is  shown  in  Figure 
1.  More  than  20  different  DNA  fragments  were  identified  as  candidates  for 
TNT-responsive  cDNAs  using  this  method.  Of  these,  three  showed  TNT 
dose-dependent  induction.  Two  other  fragments  showed  time  and  dose 
dependence.  These  were  of  particular  interest  because  the  relationship  between 
their  expression  and  TNT  exposure  suggested  that  they  might  encode 
TNT-specific  response  genes.  Six  fragments  were  present  only  at  the  highest  TNT 
concentration.  Several  additional  fragments  decreased  or  disappeared  upon  TNT 
exposure.  Finally,  four  fragments  were  picked  from  the  poly(A)+  differential 
display  experiment  because  they  appeared  only  when  RNA  was  isolated  from  cells 
that  were  continuously  grown  in  44  pM  TNT,  and  were  not  amplified  from  RNA 
isolated  from  untreated  cells. 

When  the  DNA  fragments  were  isolated  and  re-amplified  then  examined  by 
electrophoresis,  two  DNA  fragments  were  present  in  some  reactions.  One 
migrated  slightly  faster  than  expected.  An  attempt  was  made  to  re-amplify  and 
clone  the  reamplification  products  of  both  bands.  However,  this  was  often 
unsuccessful.  Fragments  were  ligated  to  plasmid  pGEM-T  and  ligation  products 

were  transformed  into  E.  coli  (strain  DH5a)  and  plated. 

All  the  sequenced  inserts  contained  complete  primer  sequences  at  their 
termini,  confirming  their  origin  as  full  length  PCR  products.  However,  when  two 
clones  from  the  same  re-amplified  DNA  fragment  were  sequenced,  the  length  and 
sequence  of  one  clone  differed  from  the  other.  This  is  consistent  with  the 
observation  of  several  polymorphic  DNA  fragments  present  in  a  single  amplified 
DNA  sample  (Bauer,  et  al.,  1993.).  Northern  blots  containing  mRNA  isolated 
from  cells  exposed  to  different  TNT  concentrations  were  therefore  hybridized  to 
probes  to  determine  which  of  the  cloned  sequences  is  representative  of  a  TNT- 
modulated  gene.  One  of  the  two  probes  hybridized  to  a  gene  that  was  constitutively 
expressed  while  the  other  hybridized  to  mRNA  present  only  in  cells  exposed  to 
TNT. 

Computer  Database  Searches.  When  DNA  sequences  of  the  cDNA  fragments 
were  compared  to  the  GenBank  database  using  the  GCG  Wordsearch  program 
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(Devereux,  et  al.,  1984),  two  clones  showed  significant  homology  to  a  previously 
described  sequence,  while  others  represented  novel  mRNA  sequences.  The 
identified  sequences,  from  fragments  AP26.2  (clone  M)  and  AP26.4  (clone  Q) 
share  71%  nucleotide  identity  with  a  portion  of  the  3'  untranslated  region  of  a 
tobacco  ( Nicotiana  silvestris )  ubiquitin  cDNA  clone  (Genschik,  et  al.,  1992). 
AP26.2  and  AP26.4  were  generated  from  RNA  extracted  from  cells  that  had  been 
growing  continuously  in  44  pM  TNT.  The  sequences  and  lengths  of  these  two 
clones  were  identical,  although  they  were  originally  cloned  from  two  different 
DNA  fragments.  It  is  possible  that  one  clone  is  a  contaminating  DNA  from 
another  fragment.  If  so,  then  a  single  band  on  a  polyacrylamide  gel  may  contain 
length,  as  well  as  sequence  polymorphisms.  Since  DNA  was  separated  on  native 
gels,  it  is  possible  that  fragments  of  different  sizes  might  migrate  together  based 
on  secondary  structures  of  the  molecules.  Because  these  clones  are  identical,  they 
were  regarded  as  a  single  cDNA  isolate. 

DNA  sequence  comparisons  using  the  BLASTN  program  revealed  no 
significant  nucleotide  sequence  matches  other  than  tobacco  ubiquitin.  BLAST 
expresses  the  statistical  significance  of  a  match  with  a  P-value,  such  that  a 
P-value  close  to  one  indicates  a  high  probability  that  such  a  match  could  occur  by 
chance  alone.  Nucleic  acid  sequence  matches  for  tobacco  ubiquitin  (clones  AP26.2 
and  AP26.4)  had  P-values  of  1.1  x  10‘6.  Ubiquitin  cDNAs  originating  from  other 
species  made  less  significant  matches,  with  Arabadopsis  thaliana  (P  =  0.38),  rice 
(P  =  0.77),  and  Brassica  rapa  (P  =  0.98).  Other  cDNA  clones,  with  relatively  high 
P-values  (P  >  1  x  10'4)  were  not  considered  homologous  to  known  DNA  sequences. 

BLASTX  translates  both  strands  of  a  nucleotide  sequence  in  all  three 
reading  frames,  then  compares  the  peptide  sequences  with  protein  sequence 
databases.  [Clone  AP29.3  (clone  I)  produced  a  polypeptide  sequence  from  the  plus 
strand  (+1)  reading  frame  that  is  rich  in  serine,  and  was  similar  to  the  serine  and 
threonine-rich  transcription  activation  (STA)  domains  (Theill,  et  al.  1989)  of  a 
variety  of  nuclear  zinc-finger  proteins  such  as  human  AF-9  (Nakamura,  et  al. 
1993),  a  leukemia-associated  protein  (P  =  1.6  x  10'9),  and  Drosophila 
ecdysone-inducible  proteins  E75A,  B,  and  C  (Seagraves  and  Hogness,  1990;  Feigl, 
et  al.  1989)  (P  =  1.4  x  10'5)  that  are  products  of  alternative  mRNA  splicing  from  the 
same  transcript.  In  addition,  a  serine-rich  domain  is  found  in  the  engrailed-\ike 
protein  smox- 2  of  the  parasitic  flatworm  Schistosoma  masoni  (Webster  and 
Mansour,  1992)  (P  =  5.8  x  10‘5).  The  plus  strand  (+3)  reading  frame  of  clone  AP26.3 
(clone  O)  encodes  a  peptide  sequence  that  was  similar  to  the  extracellular  dermal 
glycoprotein  (EDGP)  of  carrots  (P  =  2.0  x  10"16)  (Satoh,  et  al.  1992)  and  to  soybean 
basic  7s  globulin  (P  =  1.7  x  10'5)  (Kagawa  and  Hirano,  1989).  The  EGDP  gene  is 
transcriptionally  activated  in  response  to  mechanical  wounding  and  pathogen 
invasion.  The  basic  7s  globulin  mRNA  is  induced  in  soybeans  exposed  to  hot 
water  for  several  hours,  and  is  closely  related  to  EDGP.  Although  EDGP  mRNA 
is  normally  abundant  in  plant  suspension  cultures,  AP26.3  is  clearly  induced  in 
TNT-exposed  cells.  These  proteins  are  both  associated  with  wounding,  pathogen 
invasion,  and  other  forms  of  stress,  so  AP26.3  may  be  another  general 
stress-induced  cDNA,  like  the  ubiquitin  homologues  previously  described. 

The  recently  initiated  whole  plant  studies  take  advantage  of  a  system 
developed  by  The  Southwest  Wetlands  Group,  Inc.  of  Santa  Fe,  NM.  They  have 
demonstrated  that  a  combination  of  different  ecologies  can  be  used  to  treat 
municipal  and  residential  waste  water,  landfill  leachates,  industrial  wastewater, 
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acid  mine  drainage,  and  septage.  The  unique  approach  suggested  by  SWG 
incorporates  septage  as  a  carrier  media  and  uses  several  different  ecologies  -- 
aquatic,  wetlands,  and  meadow  --  that  rely  on  the  associated  microbial,  fungal, 
algal  and  plant  communities  to  function  as  consortia  in  processes  that  convert 
TNT  to  benign  nitrogenous  compounds.  The  use  of  these  complex  self-regulating 
ecologies  makes  this  biotechnology  inherently  more  effective  and  efficient.  The 
ability  to  monitor  in  situ  TNT  concentrations  indirectly  by  monitoring  gene 
expression  should  provide  a  method  to  finely  regulate  the  treatment  process  by 
monitoring  TNT-modulated  gene  expression  in  plants  located  at  different  points 
within  the  treatment  cycle.  Completion  of  such  experiments  requires 
establishment  of  the  different  constructed  ecologies.  Ongoing  establishment  of  the 
appropriate  plant  species  is  accompanied  by  use  of  D.  innoxia  probes  to  determine 
whether  TNT-modulated  genes  from  this  species  are  sufficiently  similar  in  other 
species  to  allow  monitoring  using  the  D.  innoxia  probes. 

DISCUSSION 

The  gradual  onset  of  chronic  stress  in  plants  can  result  in  adaptation. 
Persistent  changes  occur  that  allow  an  adapted  plant  to  endure  conditions  that 
would  have  killed  it,  had  the  stress  been  introduced  abruptly.  When  challenged 
with  an  acute  stress,  many  plants  halt  translation  of  mRNAs  that  are  not  related 
to  repair  or  adaptation  (Gallie,  1993).  They  do  this  by  selective  mRNA  degradation 
or  by  sequestering  transcripts  into  intracellular  compartments.  If  the  source  of 
the  stress  is  not  removed,  even  the  sequestered  transcripts  are  degraded. 
Moreover,  many  genes  are  transcriptionally  arrested  by  environmental  stress.  In 
this  way,  the  condition  of  a  plant's  environment  may  be  reflected  in  its  pattern  of 
gene  expression. 

While  transpiring  water  out  of  soil,  plants  continuously  monitor 
groundwater  and  soil  quality  over  a  large  sample  area.  If  there  are  toxic 
substances  in  the  soil,  plants  eventually  give  evidence  of  their  presence  by 
external  signs  of  stress.  However  appearance  of  these  changes  is  not  specific  to 
any  particular  compound.  A  method  is  needed  that  detects  changes  in  plants 
resulting  from  the  presence  of  a  particular  compound,  TNT,  at  concentrations 
that  are  lower  than  those  that  cause  obvious  systemic  damage. 

PCR  amplified  cDNA  fragments  generated  from  D.  innoxia  RNA  templates 
with  expression  patterns  suggesting  a  role  in  the  plant  cell  response  to  TNT  were 
identified.  The  DDRT-PCR  technique  is  a  powerful  and  practical  method  for 
discovering  novel  mRNAs  that  may  serve  as  genetic  indicators  for  TNT  exposure. 
By  simultaneously  viewing  the  effect  of  different  TNT  concentrations  at  different 
time  points,  it  was  possible  to  distinguish  TNT-dependent  responses  and  general 
stress  responses  from  cell  growth-dependent  responses  within  a  subset  of  cDNA 
fragments. 

Ten  different  arbitrary  decamers  were  used  as  primers  to  select  over  20 
candidate  DNA  fragments  that  showed  patterns  of  expression  suggestive  of 
TNT-dependent  transcriptional  modulation.  These  were  cloned  and  sequences 
were  determined  for  ten  of  these  clones.  One  cDNA  isolate,  cloned  from  both 
AP26.2  and  AP2.4,  showed  significant  similarity  to  a  known  gene,  with  71% 
sequence  identity  to  the  3'-untranslated  region  of  the  tobacco  ubiquitin  mRNA. 
Although  the  coding  regions  of  ubiquitin  genes  are  conserved,  there  is  normally 
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great  diversity  in  the  untranslated  regions  (UTR).  The  similarity  between  the 
AP26.4  clone  and  tobacco  ubiquitin  may  imply  some  shared  function  that  is 
encoded  in  this  region.  Two  functions  that  are  determined  by  3'-UTR  elements 
are  intracellular  localization  of  the  mRNA  (Hesketh,  et  al.,  1994)  and  mRNA 
stability  (Green,  1993).  Ubiquitin  is  a  highly  conserved  gene  found  in  all 
eukaryotes.  It  is  a  component  of  a  system  in  which  damaged  or  short-lived 
proteins  are  poly-ubiquitinated  and  thus  marked  for  degradation  (Cheng,  et  al., 
1994).  Such  a  gene  product  might  be  expected  to  appear  under  highly  stressful 
culture  conditions.  Another  isolate,  AP26.3,  encodes  a  polypeptide  that  is  similar 
to  the  wound-  and  pathogen-inducible  extracellular  dermal  glycoprotein  (EDGP) 
of  carrot  ( Daucus  carota ).  These  two  isolates  demonstrate  the  validity  of  the 
DDRT-PCR  technique  to  identify  and  isolate  probes  for  stress-induced  genes. 

The  appearance  of  ubiquitin  and  EDGP  among  the  differential  display 
clones  demonstrates  that  this  approach  will  identify  genes  that  respond  to  TNT 
exposure.  However,  it  also  shows  a  fundamental  obstacle  to  finding  a 
TNT-specific  genetic  response.  Many  genes  will  change  expression  in  response  to 
stress  of  any  kind.  These  generalized  responses  may  have  roles  in  repairing 
damage  to  the  cell,  such  as  scavenging  damaged  proteins,  repairing  DNA 
damage  and  arresting  the  cell  cycle.  Such  transcripts  can  be  expected  to  appear 
at  levels  of  stress  that  can  also  be  discerned  through  changes  in  growth  rate, 
tissue  damage,  or  other  external  pathological  indicators.  However,  other  genes 
will  be  quite  specific  for  particular  types  of  stress  and  may  even  be  peculiar  to 
certain  chemicals.  These  may  owe  their  specificity  to  encoding  components 
responsible  for  catabolism  of  the  chemical.  Such  genes  might  be  the  most 
sensitive  indicators  of  particular  pollutants,  since  they  would  appear  at  very  low 
pollutant  concentrations  when  widespread  cell  or  tissue  damage  had  not  yet 
occurred.  In  the  DDRT-PCR  experiments  performed  here,  such  transcripts 
might  appear  at  TNT  concentrations  that  do  not  interfere  with  long  term  cell 
growth,  but  increase  with  exposure  to  higher  TNT  concentrations.  Some  of  these 
transcripts  may  also  be  temporally  down-regulated,  as  TNT  is  absorbed  and 
degraded  in  the  cell.  Further  screening  to  demonstrate  TNT-specific  gene 
expression  modulation  is  needed  to  determine  whether  those  genes  already 
identified  are  modulated  only  by  TNT  exposure  or  whether  their  response  is  part 
of  a  broader  generalized  environmental  stress  response  in  the  plant. 

Induction  of  a  catabolic  pathway  through  increased  mRNA  abundance 
implies  the  existence  of  TNT-modulated  regulators  of  gene  expression.  These 
regulatory  proteins  might  contain  DNA-binding  and  transcription-activating 
domains  that  can  be  identified  by  their  sequence  homology  to  known  domains  in 
sequence  databases.  Clone  AP29.3  can  encode  a  serine-  and  threonine-rich 
polypeptide  that  is  similar  to  the  transcription  activation  domains  of  the  zinc 
finger  proteins  AF-9  (Nakamura,  et  al.  1993),  E75A  (Seagrave  and  Hogness,  1990) 
and  sox- 2  (Webster  and  Mansour,  1992).  Transcription  activating  proteins  might 
induce  the  coordinate  expression  of  a  variety  of  genes  responsible  for  different 
aspects  of  cell  protection  and  repair. 

When  selected  DNA  fragments  were  isolated  and  cloned,  two  isolated 
recombinant  colonies  were  picked  for  sequencing.  Sequences  of  two  clones 
derived  from  the  same  apparent  fragment  were  different,  showing  that  a  single 
gel  band  may  be  composed  of  more  than  one  cDNA  fragment.  This  has  been 
observed  by  others,  and  precludes  direct  sequencing  of  DNA  isolated  from  the  gel 
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(Bauer,  et  al.,  1993).  To  sequence  the  differentially  expressed  cDNA,  all  of  the 
cDNAs  in  a  fragment  must  be  cloned,  and  the  differentially  expressed  clone  must 
be  distinguished  from  contaminating  sequences.  This  was  accomplished  by 
producing  DNA  or  RNA  using  different  cDNA  clones  as  template  then 
hybridizing  this  to  Northern  blots  containing  mRNA  isolated  from  cells  exposed  to 
different  TNT  concentrations.  The  induced  sequences  were  readily  apparent. 

Once  a  single  TNT-responsive  cDNA  clone  is  isolated,  its  sequence  can  be 
used  to  design  specific  primers  for  PCR  amplification,  and  conventional  RT-PCR 
can  be  used  to  detect  this  transcript.  The  specific  clone  can  be  screened  further  for 
TNT  specificity  by  probing  RNA  from  cells  treated  with  different  stress-inducing 
agents.  These  agents  can  be  chemically  related  or  unrelated  to  TNT.  Moreover, 
complete  cDNA  and  genomic  clones  can  be  isolated  from  libraries  for  further 
characterization.  An  array  of  highly  specific  TNT-induced  cDNA  probes  can  then 
be  developed  into  a  molecular  bioassay  for  detecting  TNT-induced  changes  in 
plant  gene  expression.  This  may  allow  detection  of  very  small  amounts  of  TNT  in 
soils.  Messenger  RNA  markers,  especially  those  that  accumulate  in 
easily-sampled  portions  of  the  plant,  may  be  detected  using  sensitive  RT-PCR 
assays.  If  other  specific  molecular  probes  can  be  found  for  other  chemicals,  it 
may  be  possible  to  simultaneously  detect  different  pollutants  using  tissue  from  a 
single  plant. 


CONCLUSIONS 

The  results  presented  to  date  strongly  support  our  hypothesis  that  plants 
can  be  used  as  environmental  sentries  to  monitor  soil  and  water  for  selected 
contaminants.  In  this  study  we  have  focused  on  TNT  contamination.  However, 
in  principle,  such  an  approach  could  be  used  to  detect  any  contaminant  that  is 
water  soluble,  taken  up  by  plants  and  has  a  specific  impact  on  plant  gene 
expression. 

Considerably  more  research  is  required  to  generate  more  TNT-modulated 
gene  probes  and  demonstrate  that  these  are  TNT-specific,  or  at  least  specific 
enough  so  that,  when  used  with  a  battery  of  different  probes,  a  TNT-specific 
signature  of  gene  expression  is  produced.  While  early  efforts  focused  on 
identifying  TNT-induced  gene  fragments.  We  have  also  identified  a  number  of 
fragments  that  represent  genes  whose  expression  appears  to  be  suppressed  by 
TNT-exposure.  In  the  future,  we  will  continue  testing  probes  to  determine  their 
specificity  and  isolating  the  intact  genes  (and  their  regulatory  elements). 

Transfer  of  this  technology  to  the  field  requires  demonstration  that  what  is 
found  in  cell  cultures  and  hydroponically  grown  plants  also  occurs  in  plants 
growing  in  their  native  environment.  To  demonstrate  this,  we  are  establishing  a 
collaboration  with  The  Southwest  Wetlands  Group,  Inc.,  a  small  company  in 
Santa  Fe,  New  Mexico.  They  have  independently  demonstrated  that  a  septage 
driven  system  of  constructed  ecologies  can  decontaminate  soil  and  effluent 
contaminated  with  munitions  and  certain  other  organic  compounds.  Optimal 
use  of  their  system  requires  continuous  monitoring  of  TNT  concentrations  at 
different  points  in  the  treatment  cycle.  We  hope  to  demonstrate  whether  probes 
for  TNT-induced  genes  from  D.  innoxia  can  be  used  to  monitor  these  same  genes 
in  selected  plants  present  in  their  different  constructed  ecologies.  Such  a  method 
of  monitoring  would  provide  a  rapid  method  of  determining  what  factors  are 
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responsible  for  optimizing  the  treatment  process.  These  experiments  will  also 

demonstrate  whether  such  an  approach  can  be  used  in  practical  field 

applications.  At  this  point,  the  results  are  extremely  promising. 
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Appendix  A: 

BLASTN  Nucleic  Acid  Sequence  Search  Results. 
List  only  the  five  highest  scoring  matches. 


Query*  94-0530a20.Seq  -20.5  sequence  of  clone  "A*  (AP21.1) 

(170  letters,  both  strands) 

Database:  Non-redundant  FDB+GBupdate+GenBank+EMBLupdate+EMBL,  5:16  AM  EDT  Sep 
9,  1994 

203,796  sequences;  207,906,619  total  letters. 

Smallest 

Poisson 

High  Probability 


Sequences  producing  High-scoring  Segment  Pairs:  Score  P(N)  N 

emb 1X14597 IPHCHSJ  P. hybrids  chsJ  gene  for  chalcone  synthase  115  0.98  1 
emb 1X76558 I5PRAD26  S.pombe  rad26  gene.  112  0.9994  1 
emb 1X64587 IMMU2AF  M.musculus  mRNA  for  splicing  factor  U2 . . .  Ill  0.9999  1 


Query*  94-0630120. Seq  -20.5  sequence  of  clone  -L*  (AP26.2) 

(264  letters,  both  strands) 

Database:  Non-redundant  PDB+GBupdate+GenBank4EMBLupdate4EMEL,  5:16  AM  EDT  Sep 
9,  1994 

203,796  sequences;  207,906,619  total  letters. 

Smallest 

Poisson 

High  Probability 

Sequences  producing  High-scoring  Segment  Pairs:  Score  P(N)  N 

***  NONE  *** 


Query*  94-0630m20.Seq  -20.5  sequence  of  clone  *M#  (AP26.2) 

(229  letters,  both  strands) 

Database:  Non-redundant  PDB4GBupdate4GenBank4EMBLupdate4EMBL,  5:16  AM  EDT  Sep 
9,  1994 

203,796  sequences;  207,906,619  total  letters. 

Smallest 
/  Poisson 
High  Probability 


Sequences  producing  High-scoring  Segment  Pairs:  Score  P(N)  N 

gb IM74100 ITOBCEP527A  Nicotiana  sylvestris  (clone  6PCEP52-...  194  l.le-06  1 
gbl J05507 IATHUXP1A  A.thaliana  ubiguitin  extension  prote...  128  0.39  1 
emb 1 235369 IATTS3806  A.  thaliana  transcribed  sequence;  cl...  127  0.39  1 
emblZ34610IATTS3370  A.  thaliana  transcribed  sequence;  cl...  127  0.41  1 
gbl J05508 IATHUXP2A  A.thaliana  ubiquitin  extension  prote...  127  0.44  1 


Query*  94-0530n20 .Seq  -20.5  sequence  of  clone  *N*  (AP26.3) 
(215  letters,  both  strands) 


Database:  Non-redundant  PDB4GBupdate4GenBank4EMBLupdate4EMBL,  5:16  AM  EDT  Sep 
9,  1994 

203,796  sequences;  207,906,619  total  letters. 

Smallest 

Poisson 

High  Probability 

Sequences  producing  High-scoring  Segment  Pairs:  Score  P(N)  N 

***  NONE  ••• 


Query*  94-0530o20 .Seq  -20.5  sequence  of  clone  *0*  (AP26.3) 

(242  letters,  both  strands) 

Database:  Non-redundant  PDB4GBupdate4GenBank4EMBLupdate4EMBL,  5:16  AM  EDT  Sep 
9.  1994 

203,796  sequences;  207,906,619  total  letters. 

Smallest 

Poisson 

High  Probability 

Sequences  producing  High-scoring  Segment  Pairs:  Score  P(N)  N 

emb 1 234801 ICEF59A2  Caenorhabditis  elegans  cosmid  F59A2  127  0.48  1 
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Query*  94-0630p20 .Seq  -20.5  sequence  of  clone  *P*  (AP26.4) 

Database:  Non-redundant  PDB+GBupdate+GenBank+EMBLupdate+EMBL,  5:16  AM  EDT  Sep 
9,  1994 

203,796  sequences;  207,906,619  total  letters. 


Sequences  producing  High-scoring  Segment  Pairs: 

emb 1X56603 IMMPCBP  M.musculus  mRNA  for  calcium-binding  p... 
gb IM92988 IMUSERP60A  Mouse  calregulin  (ERp60)  mRNA,  comple... 
emblX14926 IMMCRP55  Mouse  mRNA  for  calreticulin 

Query*  94-0630q20 .Seq  -20.5  sequence  of  clone  *Q-  (AP26.4) 
(229  letters,  both  strands) 


9,  1994 

203,796  sequences;  207,906,619  total  letters. 


Sequences  producing  High-scoring  Segment  Pairs; 

gb IM74100 ITOBCEP527A  Nicotiana  sylvestris  (clone  6PCEP52- . . 
gb l J05507  t ATHUXP1A  A.thaliana  ubiquitin  extension  prote.. 
emb I Z3 53 6 9  I ATTS3808  A.  thaliana  transcribed  sequence;  cl.. 
emblZ34610 I ATTS3370  A.  thaliana  transcribed  sequence;  cl., 
gb 1 J05508 I ATHUXP2A  A.thaliana  ubiquitin  extension  prote.. 

Query*  94-0530e20 .Seq  -20.5  sequence  of  clone  *E*  (AP29.1) 
(184  letters,  both  strands) 


High 

Score 


Smallest 
Poisson 
Probability 
P(N)  N 


115 

115 

115 

0.998 

0.999 

0.999 

1 

1 

1 

,,  5:16 

AM  EDT  Sep 

Smallest 

Poisson 

High 

Probability 

Score 

P(N) 

N 

194 

l.le-06 

1 

128 

0.38 

1 

127 

0.39 

1 

127 

0.41 

1 

127 

0.44 

1 

Database:  Non-redundant  PDE+GBupdate4GenBank+EMBLupdate+EMBL,  5:16  AM  EDT  Sep 

9,  1994 

203,796  sequences;  207,906,619  total  letters. 

Smallest 

Poisson 

High  Probability 

Sequences  producing  High-scoring  Segment  Pairs:  Score  P(N)  N 

gb IU08931 INTU0893 1  Nicotiana  tabacum  SRI  seed  coat-specif . . .  122  0\69  1 

gb I L12467 1YSCGCN1A  Saccharomyces  cerevisiae  translational...  115  0.991  ^  ^1 

Query*  94-0530g20 .Seq  -20.5  sequence  of  clone  *G*  (AP29.2) 

(213  letters,  both  strands) 


Database:  Non-redundant  PDB* GBupdat e+Gen£ank* EMBLupda t e* EMBL ,  5:16  AM  EDT  Sep 
9,  1994 

203,796  sequences;  207,906,619  total  letters. 

Smallest 


Sequences  producing  High-scoring  Segment  Pairs: 

gb I S74758 I S74758  mas5=heat  shock  protein  [Saccharomyc . 
emb 1X56560  I SCYDJ1  Yeast  YDJ1  gene  for  a  protein  homolo. 
emb 1X77783  I SCACCOAST  S. cerevisiae  gene  for  acyl-CoA  synth. 
gb IU02878 IRPU02878  Rickettsia  prowa2ekii  Madrid  E  trans. 
dbj ID14877 ICLOORF1  Clostridium  perfringens  orf,  virR  an. 


High 

Poisson 

Probability 

Score 

P(N) 

N 

133 

0.16 

1 

133 

0.16 

1 

119 

0.93 

1 

117 

0.98 

1 

114 

0.9991 

1 

Query*  94-0630i20 .Seq  -20.5  sequence  of  clone  “I*  (AP29.3) 
(190  letters,  both  strands) 


Database:  Non-redundant  PDB+GBupdate+GenBank+EMBLupdate+EMBL,  5:16  AM  EDT  Sep 
9,  1994 

203,796  sequences;  207,906,619  total  letters. 

Smallest 


Sequences  producing  High-scoring  Segment  Pairs: 

gb IU00431 IU00431  Mus  musculus  HMG-1  mRNA,  complete  cds. 
emb I Y0 03 65  I CHHMG1  Chinese  hamster  HMG-1  gene  for  high  .. 
emb I Z11997 IMMNHCHMG1  M.musculus  mRNA  for  non-histone  chro. . 
gb! J03998 IPFAGAR  P. falciparum  glutamic  acid-rich  prot.. 
emblX13703 IDDPDG1  Dictyostelium  plasmid  pDGl  DNA 


Poisson 


High 

Probability 

Score 

P(N) 

N 

160 

0.00068 

1 

160 

0.00070 

1 

160 

0.00070 

1 

160 

0.00070 

1 

151 

0.0042 

1 

18 


Appendix  B: 

BLASTX  Peptide  Sequence  Search  Results. 
List  only  the  five  highest  scoring  matches. 


Query=  94-0530a20. Seq  -20.5  sequence  of  clone  *A*  (AP21.1) 
(170  letters) 


Translating  both  strands  of  query  sequence  in  all  6  reading  frames 


Database:  Non-redundant  PDB*SwissProt4SPupdate+PIR+GenPept+GPupdate,  5:02  AM 
EDT  Sep  9,  1994 

127,124  sequences;  36,206,892  total  letters. 

Smallest 

Poisson 

Heading  High  Probability 

Sequences  producing  High-scoring  Segment  Pairs:  Frame  Score  P(N)  N 


sp I P33042 1VL03_VARV 
sp I P29663 ICYB_AMICA 
Sp I P34869 ICYB_HETFR 
sp I P3  4  87 1 1 CYB^ISUPA 
gp IU06488 I PRU064  88_1 


PROTEIN  L3 . >pir II36B44II36844  M 
CYTOCHROME  B  (EC  1.10.2.2)  (FRA 
CYTOCHROME  B  (EC  1 .10.2 .2) .>gp I 
CYTOCHROME  B  (EC  1 . 10 .2 .2 ) . >gp I 
cytochrome  b  (Mitochondrion  Poe 


+ 1 
-3 
-3 
-3 
-3 


41  0.13 

39  0.40 

37  0.44 

37  0.44 

38  0.48 


2 

2 

2 

2 

2 


Query=  94-0630120. Seq  -20.5  sequence  of  clone  (AP26.2) 

(264  letters) 


Translating  both  strands  of  query  sequence  in  all  6  reading  frames 


Database:  Non-redundant  PDB+SwissProt+SPupdate+PIR+GenPept+GPupdate,  5:02  AM 
EDT  Sep  9,  1994 

127,124  sequences?  36,206,892  total  letters. 

Smallest 

Poisson 

Reading  High  Probability 

Sequences  producing  High-scoring  Segment  Pairs:  Frame  Score  P(N)  N 


pir 1A45560 I A45560  SPAG-l=sporozoite  surface  antig...  -2 
sp |P35499ICINS_HUMAN  SODIUM  CHANNEL  PROTEIN,  SKELETA. . .  42 
gp t L04236 IHUMSCN4A21_ 1  skeletal  muscle  sodium  channel  ...  42 
gp IM81758 IHUMSKM1A_1  sodium  channel  alpha  subunit  [H...  42 
pir IS27809 1S27809  GTPase-activating  protein  -  fru...  -2 


43  0.0015  4 

41  0.022  3 

41  0.022  3 

41  0.022  3 

48  0.029  2 


Query=  94-0630m20 .Seq  -20.5  sequence  of  clone  *M*  (AP26.2) 
(229  letters) 


Translating  both  strands  of  query  sequence  in  all  6  reading  frames 


Database:  Non-redundant  PDB4SwissProt4SPupdate4plR4GenPept4GPupdate,  5:02  AM 

EDT  Sep  9,  1994 

127,124  sequences;  36,206,892  total  letters. 

Smallest 

Poisson 

Reading  High  Probability 

Sequences  producing  High-scoring  Segment  Pairs:  Frame  Score  P(N)  N 


sp  !  P 1 1 0 0  0  I  WAPA_STRMU 
pir IPN0528 IPN0528 
pir IA60095 i A60095 
pir I JL0142 I JL0142 
sp I PI 62 8 4 IPEC1_HUMAN 
*-******•***«♦***•*•»' 


WALL-ASSOCIATED  PROTEIN  PRECURS. 
G-protein  coupled  receptor  type, 
larval  glue  protein  1  precursor, 
leukocyte  antigen  CD31  precurso. 
PLATELET  ENDOTHELIAL  CELL  ADHES . 


43 

-2 

43 

43 

43 


44  0.062 

57  0.085 

40  0.13 

58  0.20 

58  0.20 


Query=  94-0530n20.Seq  -20.5  sequence  of  clone  *N*  (AP26.3) 
(215  letters) 


Translating  both  strands  of  query  sequence  in  all  6  reading  frames 


Database:  Non-redundant  PDB4SwissProt4SPupdate4pIR4GenPept4GPupdate,  5:02  AM 

EDT  Sep  9,  1994 

127,124  sequences;  36,206,892  total  letters. 

Smallest 

Poisson 


Sequences  producing  High-scoring  Segment  Pairs: 


Reading  High 
Frame  Score 


Probabilir 

P(N) 


gplL10910|HUMHCCAJL 
gp 1 L10911 !HUMHCCB_1 
sp I P17132 IROC_RAT 
gp I L3  5  8  9  9  I DROAPRO J_1 
gp IU11274 IMMU11274_1 


splicing  factor  (Homo  sapiens)  -3 
splicing  factor  (Homo  sapiens)  -3 
HETEROGENEOUS  RIBONUCLEOPROTEIN . . .  -1 
Drosophila  melanogaster  project...  -3 
A4U-rich  RNA-binding  protein  (M...  -1 


42  0.070 

42  0.071 

42  0.12 

62  0.17 

42  0.23 
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V 


Query=  ^"°5^°°20.Seq  -20.5  sequence  of  clone  *0-  (AP26.3) 

Translating  both  strands  of  query  sequence  In  all  6  reading  frames 
O.Ol»S.,  S,02  A» 

127,124  sequences;  36,206,892  total  letters. 

Smallest 

Poisson 

Sequences  producing  High-scoring  Segment  Pairs:  s58  billtV 


S 1X16469 ISSSSSS  fgbeirSrgene  ‘^sicSTilo 

K£  8  a&a 

. conglutin  gamma  precursor  -  narr. 


Reading  High 
Frame  Score 


2 . 0e-16 
1.7e-05 
1.7e-05 
1.7e-0S 
0.0027 


Cuery=  j^^^P^O^Seq  -20.5  sequence  of  clone  *P*  (AP26.4) 

Translating  both  strands  of  query  sequence  in  all  6  reading  frames 

Database:  Non-redundan^PDB.SwissProt.SPupdate.PlR^enPepfGPupdate,  5:02  AH 

127,124  sequences;  36,206,892  total  letters. 

Smallest 

Poisson 

Sequences  producing  High-scoring  Segment  Pairs:  Score  PIN^^^N 

lilillllllilllil?  ^^Si'SSL'SSS::: :i  8 

S  S1J  E-SSSSr:::  :l  8  {•“ 

f  *  *  n  * 3 i * ? • fTf i?*  *  ^OUANINE  NUCLEOTIDE-BINDING  PROTEI . . .  +3  41  o!l£ 


Sequences  producing  High-scoring  Segment  Pairs: 


0.012 

0.022 

0.15 

0.15 

0.15 


*****♦«•»**«•  ******** ****;v7*virrrr  .. .  41  0.15 

Ouerv-  94  1  11  1  . * . . . 

Q  (2296letterstQ  ’2°*5  sequence  of  cIone  *Q*  (AP26.4) 

Translating  both  strands  of  query  sequence  in  all  6  reading  frames 
Database:  Non-red undant^PDB.SvissProt.SPupdate.PlR.GenPept.GPupdate,  5:02  AM 
127,124  sequences;  36,206,892  total  letters. 


Sequences  producing  High-scoring  Segment  Pairs: 


Smallest 

Poisson 


Frame 

Score 

P(N) 

-  -  '-j 

N 

. .  43 

47 

0.022 

2 

..  +3 

42 

0.022 

2 

.  .  4l 

37 

0.052 

3 

4  1 

..  43 

41 

39 

0.096 

0.30 

2 

2 

Query*  **~0530e20 -s*9  -20.5  sequence  of  clone  *E*  (AF29.1) 
no*  letters) 

Translating  both  strands  of  query  sequence  in  all  6  reading  frames 

Database:  Non-^undantjDB.SwissProt.SPupdate.PlR.GenPept.GPupdate,  5:02  AM 

127,124  sequences;  36.206,892  total  letters. 


Sequences  producing  High-scoring  Segment  Pairs: 

?  igggf-BPB_1  gamma FBP-B  [Gallus  gallus] 
gp  1X79051  IGGGFBPA_1  gammaFBP-A  [Gallus  aAn..ei 
gplX79011 |GGGFBPC_1  gammaFBP-C  [Gallus  aallusl 
pir IS07571 IS07571  twitchin  - 


Smallest 
„  Poisson 

Heading  High  Probability 
Frame  Score  p<N)  N 


:1  «  0.36  2 


41  0.31 

41  0.32 

41  0.33 
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°“'rv'  AViasrA^ay:; 

JT7  *“!  °f  wry  *,,u'nce  in  *u  •  *— 


c  sum 

Reading  High  Probability 
Frame  Score  p(N)  N 


Sequences  producing  High-scoring  Segment  Pairs: 

SP IM15816 ISYNT0X1  1  Synthetic  air,*.*  < 

*P I P27031 IRHAA  SALTY  L-RHAMNO^tS^bJ®  t0X  ®ene  DNA< 

SP IZ23277  ICDTOXBA  1  taSrvfSi  280M?5ME  (BC  5.3.1.14. . . 
SPIP12526IMAS_RAT  MAS^ROTO  ONr^^"’  difficile) 

sp  I P04201 IMAS  HUMAN  MAS  PROto‘™2£™£‘  IA31816  IT. . . 

. >Pir  I A01375  IT. . . 

~w-  mjtir  -20-5  — U 

both  ot  wty  in  4JJ  6  tMaiw  ^ 

SfsSTs.  rDs*s“i*sr”'*s^«..pJR.0e„P.pt.SPupa,t./  5 1 

127,124  sequences:  36,206.892  total  letters. 
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42 

-3 

-3 
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65 

65 

56 

52 

52 


0 . 088 

0.58 

0.65 

0.97 

0.97 


1 

a 

3 

2 

2 
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Sequences  producing  High-scoring  Segment  Pairs: 


SPIL13744 IHUMAF9X  1 
SPIU02548IAAU02548  1 
SPIU00691IU00691  4~ 
sp IP17671 IE75A  DROME 
sp I P17672 IE75B_DROME 


Reading  Hig: 
Frame  S cor 


KWf«E-n«c:BLE_raSS5  Sir::: 


********1 


Smallest 
Poisson 
Probability 
P(N)  n 

l.Se-09  2 

5 . 0e-06  2 
6 .3e-06  3 

1.4e-05  2 

1 . 6e-05  2 
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Appendix  C: 

Complete  cDNA  Sequence  of  Differential  Display  Products. 

>  -20.5  sequence  of  clone  “A*  (AP21.1)  94-0530a20 .Seq 

CGAGTATGAGCCCAGATCTAGGGGGACTTTAACAGCAACGTGTAAACAAATATGGAACATTCAAGAAAAGGAAATGCTCWAACCCAG 

GGAGATCAGAAACATATTACACAACATATOATAAAATGTGTTTCTTCAGTTCTCTTCATTCACATCATACTCCTCATACTCG 

>  -20.5  sequence  of  clone  (AP26.2)  94-0630120. Seq 

GAAGCTGAGGCAATIGTTACCAATCATTTGGTTGTACGAGGCAGCTATCGAAGCCTTAGCTTGGTTGTGTATGGAAACACCACTGAAG 

ATCTTGGGCAATrCAACATAGATGTCGACTTAGATGGCTCTTTGGCTAATACTGTTAGTGTTGTTGAGGGAGACCTTGAAGACCTwCC 

ACCTGCACTGCGTCCAAATAATCTATCTACTGAGCAAACArrGTCTTCCTTGAAATCATTATCTTTTGAAAAGTATTCCCTCAGCTTC 

>  -20. S  sequence  of  clone  *M*  (AP26.2)  94-0630m20.Seq 

GAAGCTGAGGGTTCACAAATGGATATTAGAATAAGTCAGGCTAGTGTAAGCTTGCACTTATCCGATAATATTGACGTTTTAAGACAAC 

TACTAGTCTATCCACTACAGCAATTTGGAGCAAAACAGAAAACTGTAAGACCACAGACTGGCAGTATAACTGCAATTCACACAATTGA 

TCTACTAGAAAGGATGTCATGCCTACTTGATCTTCTrCTTTGGCCTCAGCTTC 

>  -20. S  sequence  of  clone  *N*  (AP26.3)  94-0530n20.Seq 

GAAGCTGAGGATTAACGGAAGGAAGATTCAAGCCACTGCAACAGTGCCATTTCGTACAGCTTTATTTGCATCAGTTTCTTTAGAGGGG 

GTCTCTGTTTITGTTCTTGAGGATCTCTTTGATGCAGAAATTTTCCA'J’i’HVTTTCANTAGNTTCGNCCAKTTCTATCTGTTCTTG-C 

TGCATTCTTGGCTACAAAATGGTGTATCCCCTCAGCTTC 

>  -20.5  sequence  of  clone  *0*  (AP26.3)  94-0530o20.Seq 

GAAGCTGAGGGGTTAAGGGTATGGCTGGTCTTGGAAGAACAAGAATATCTCTTCCTTCTCAATTCrCAGCTGAATTTAGTTTCCCTAG 

ATTTTCAAATGATGATTTTTCCTACACTCCACTTTTTATCAATCCAGTTAGCACAGNCTCAGCTTCAATCACTAGTGCGGGCGCCTGC 

AGGTCGACCATATGGGAGAGCTCCCAACGCGTTGGATGCATAGCTTGAGTATTCTATAGTGTCAACG 

>  -20.5  sequence  of  clone  *P‘  (AP26.4)  94-0630p20 .Seq 

SSSS3S SSS=‘SSS« 

GAAGCCAGCAAAGCAGAGGACCATTTTTGGGTTCCACCCCACCACAACTCCCCACCCCTCAGTTTCANTCACTAGTGCGGTCGCCTGC 

AGGTCG ACCATATGGG AG AGCTCCC AN  CGCGTTGG  C 

>  -20.5  sequence  of  clone  *Q*  (AP26.4)  94-0630q20 .Seq 

GAAGCTGAGGGTTCACAAATGGATATTAGAATAAGTCACGCTAGTGTAAGCTTGCACTTATCCGATAATATTGACGTTTTAAGACAAC 

TACTAGTCTATCCACTACAGNAATTTGNAGCAAAACAGAAAACTGTAAGACTACAGACTGGCAGTATAACTGCAATTCACACAnTTGA 

TCTACTAGAAAGGATGTCATGCCTACTTGATCTTCTTCTTTGGCCTCAGCTTC 

>  -20.5  sequence  of  clone  ‘E*  (AP29.1)  94-0530e20.Seq 


GGCATTCC 

>  -20.5  sequence  of  clone  *G*  (AP29.2)  94-0530g20.Seq 

GGAATGCCATGGCCACATCAAAGAAAGTAATCACTAGAGATAATTGGGAGAAAAGGCTCAATGATGTAAAGATTAGAAAAGAAGATAC 

AGATAGATTGGTAATGAATTTTCTAGTTACAGAGGGTTATGTAGAAGCTGCAGAAAAGTTCAGAATAGAATCTGGGACTGATCCTGA. 

ATGGATCTTGCTACCATCACGGATCGGATGGCATTCC 

>  -20.5  sequence  of  clone  *1*  (AP29.3)  94-0630i20.Seq 

GGAATGCCATATACTACTTCTAGTTAAACCAATTTGNAATGATCAAAATTTACATAAGCTTAGAGAACTAGTCATTTCAAAAATCTAC 

ACCAAAAGCTCTATTCATCATCTACTCCTATCTCATCTTTCACTTCCTCATCTACTGCTATCTCTTTCTCTTCCTTCTCATCATCTTC 

ATCAATGGCATTCC 
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